Almost Strong Consistency: "Good Enough" in Distributed Storage Systems

نویسندگان

  • Hengfeng Wei
  • Yu Huang
  • Jiannong Cao
  • Jian Lu
چکیده

A consistency/latency tradeoff arises as soon as a distributed storage system replicates data. For low latency, modern storage systems often settle for weak consistency conditions, which provide little, or even worse, no guarantee for data consistency. In this paper we propose the notion of almost strong consistency as a better balance option for the consistency/latency tradeoff. It provides both deterministically bounded staleness of data versions for each read and probabilistic quantification on the rate of “reading stale values”, while achieving low latency. In the context of distributed storage systems, we investigate almost strong consistency in terms of 2-atomicity . Our 2AM (2-Atomicity Maintenance) algorithm completes both reads and writes in one communication round-trip, and guarantees that each read obtains the value of within the latest 2 versions. To quantify the rate of “reading stale values”, we decompose the so-called “old-new inversion” phenomenon into concurrency patterns and read-write patterns, and propose a stochastic queueing model and a timed balls-into-bins model to analyze them, respectively. The theoretical analysis not only demonstrates that “old-new inversions” rarely occur as expected, but also reveals that the read-write pattern dominates in guaranteeing such rare data inconsistencies. These are further confirmed by the experimental results, showing that 2-atomicity is “good enough” in distributed storage systems by achieving low latency, bounded staleness, and rare data inconsistencies.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Olive: Distributed Point-in-Time Branching Storage for Real Systems

This paper describes Olive, the first distributed block storage system to provide consistent pointin-time branching. Point-in-time branching allows users to recursively and quickly snapshot or clone the storage state. It has a wide range of applications including testing new deployments or upgrades without disrupting a running system, quickly provisioning large homogeneous systems, and preservi...

متن کامل

DAX: A Widely Distributed Multi-tenant Storage Service for DBMS Hosting

Many applications hosted on the cloud have sophisticated data management needs that are best served by a SQL-based relational DBMS. It is not difficult to run a DBMS in the cloud, and in many cases one DBMS instance is enough to support an application’s workload. However, a DBMS running in the cloud (or even on a local server) still needs a way to persistently store its data and protect it agai...

متن کامل

A simple Byzantine-fault-tolerant algorithm for a multi-writer regular register

Distributed storage systems have become popular for handling the enormous amounts of data in network-centric systems. A distributed storage system provides client processes with the abstraction of a shared variable that satisfies some consistency and reliability properties. Typically the properties are ensured through a replication-based implementation. This paper presents an algorithm for a re...

متن کامل

Client Zone 1 Zone 2 Zone 3 Shard A Shard

Application programmers increasingly prefer distributed stor-age systems with strong consistency and distributed transac-tions (e.g., Google’s Spanner) for their strong guarantees andease of use. Unfortunately, existing transactional storage sys-tems are expensive to use – in part because they require costlyreplication protocols, like Paxos, for fault tolerance. In thisp...

متن کامل

Granola: Low-Overhead Distributed Transaction Coordination

This paper presents Granola, a transaction coordination infrastructure for building reliable distributed storage applications. Granola provides a strong consistency model, while significantly reducing transaction coordination overhead. We introduce specific support for a new type of independent distributed transaction, which we can serialize with no locking overhead and no aborts due to write c...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1507.01663  شماره 

صفحات  -

تاریخ انتشار 2015